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Abstract 

Correlated proportions arise in longitudinal (panel) studies. A typical ex- 
ample is the "opinion swing" problem: "Has the proportion of people favoring 
a politician changed after his recent speech to the nation on TV?". Since the 
same group of individuals is interviewed before and after the speech, the two 
proportions are correlated. A natural null hypothesis to be tested is whether 
the corresponding population proportions are equal. A standard Bayesian ap- 
proach to this problem has already been considered in the literature, based on a 



Dirichlet prior for the cell-probabilities of the underlying two-by-two table un- 
der the alternative hypothesis, together with an induced prior under the null. 
In lack of specific prior information, a diffuse (e.g. uniform) distribution may be 
used. We claim that this approach is not satisfactory, since in a testing prob- 
lem one should make sure that the prior under the alternative be adequately 
centered around the region specified by the null, in order to obtain a fair com- 
parison between the two hypotheses. Following an intrinsic prior methodology, 
we develop two strategies for the construction of a collection of objective pri- 
ors increasingly peaked around the null. We provide a simple interpretation of 
their structure in terms of weighted imaginary sample scenarios. We illustrate 
our method by means of three examples, carrying out sensitivity analysis and 
providing comparison with existing results. 

Keywords: Bayes factor; Bayesian robustness, Dirichlet prior; marginal homo- 
geneity; matched pair; objective Bayes. 

1 Introduction 

Panel data are often used to investigate changes in opinion or behavior. For instance, 
suppose the same group of individuals is interviewed about their support for the 
President before and after the State of the Union Address. A natural issue is whether 
the speech has generated or not a net change in attitude. If the answer can only be 
"I support the President" or "I do not support the President", i.e. it is binary, the 
problem reduces to the comparison of two proportions in the population: of those 
supporting the President before and after the address. We say that no net change 
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has occurred if the two proportions are equal. Clearly, the two sample proportions 
are correlated, since the same group of people is involved in the two measurements. 
A Multinomial model involving the four pairs of outcomes "(Support, Support)", 
"(Support, Not Support)" , "(Not Support, Support)" , "(Not Support, Not Support)" 
is the appropriate sampling scheme to be used. 

A classic frequentist test dates back to McNemar (1947). The first Bayesian anal- 
ysis is attributed to Altham (1971), while more recent contributions include Broemel- 
ing & Gregurich (1996), Irony et al. (2000) and Kateri et al. (2001). In particular, 
the latter two papers compute the Bayes Factor (BF) for the hypothesis of no net 
change versus the alternative that a net change has occurred, starting from a "de- 
fault" Dirichlet prior on the unconstrained cell-probabilities. In lack of specific prior 
information, a diffuse (e.g. uniform) distribution may be used. The prior distribu- 
tion under the hypothesis of no net change is then induced from the Dirichlet prior. 
We call this a "default" BF for testing the equahty of two correlated proportions, 
and claim that it is not satisfactory, since it might unduly favor the null hypothesis. 
To overcome this drawback, we keep the same prior under the null and construct 
a suitable intrinsic prior (Berger & Pericchi, 1996) under the alternative, using an 
objective approach along the hues recently described by Casella & Moreno (2006, 
2007). The intrinsic prior is centered around the region where the null obtains, and 
the corresponding BF results in a fairer comparison between the two hypotheses. As 
the amount of centering is controlled by a scalar hyper-parameter, we actually have 
a family of prior distributions. We suggest studying BF sensitivity to prior choice 
within this family. 
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The rest of the paper is organized as follows. In Section 2 we give a new derivation 
of the above default BF for testing the equality of two correlated proportions; in 
Section 3 we discuss the concept of an intrinsic prior for testing hypotheses, and 
derive intrinsic priors for the problem at hand; in Section 4 we illustrate our method 
by means of three examples. Finally, Section 5 contains a brief discussion. 



2 Default approach 

We assume the data consist of observations on a pair of binary variables, {U, V) 
say, each taking values in {0, 1}, with n++ known by design. To fix ideas, let U — 1 
{V = 1) represent the opinion "I support the President", and U — {V = Q) 
the opinion "I do not support the President" , before (after) the State of the Union 
Address. A swing occurs whenever {U = 0,V = 1) or {U = 1,V = 0). We can 
arrange the four counts in a 2 x 2 table n — (noo, noi, nio, nn), where riQo + noi + nio + 
rill — ^++- If the data are independent, conditionally on some common parameter, 
the statistical model for the data is quadrinomial with matrix of cell-probabilities 
— {'^00, TToi, TTio, TTii), whcrc TToo + ttqi + TTio + TTn — 1. If no net change has occurred, 
the marginal distributions of U and V are the same, so that the problem under 
investigation consists in testing marginal homogeneity in a two-way contingency table, 
i.e. 7ro+ = TT+o, where subscript "-I-" denotes summation over the replaced suffix. In 
a 2 X 2 table, however, marginal homogeneity reduces to symmetry: ttqi = ttiq. 

We take advantage of the following reparametrization: rj — noi + ttio, 6 — 
and 7 = ^^^^^^^ ■ Notice that r] is the (unconditional) probability of observing a swing, 
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whereas 9 is the conditional probabihty of a swing from to 1, given that a swing 
occurs, and 7 is the conditional probability of no support for the President, given 
that no swing occurs. The sampling distribution of n can thus be written as 

/(n|?7,6',7) = 5'(nA|r^)/i(noi|nA,6')A;(noo|nA,7), (1) 

writing riA = Uqi + Uiq for the sum of the off-diagonal counts and letting g{n^\r]) — 
Bin(nA|n++,?7), /i(noi|nA, 6*) = Bin(noi|nA, 6^) and A;(noo|nA,7) = Bin(noo|n++ - 
^A,7), where Bin(y|m, ■0) is the Binomial probability function with m trials, and 
probability of success if), evalutated at y. 

With reference to the (ry, ^, 7)-parametrization, the null hypothesis of no net 
change can be formulated &s Hq : 9 — whereas the alternative hypothesis is 
H : 9 ^ \. Notice that rj and 7 are nuisance parameters, and that the parameter of 
interest 9 only appears in /i(noi|nA, the conditional distribution of noi given n^. 
Accordingly, Irony et al. (2000) base their analysis exclusively on the partial sampling 
model /i(noi|nA, 6*). This is also true for the classical test by McNemar (1947), which 
only makes use of the off-diagonal individual swing-counts (noi, tt-io), and for Altham 
(1971), who performs Bayesian inference on the difference ttqi — ttiq. However, this 
issue is controversial, as some Authors suggest that the whole sample size n++ should 
also be taken into consideration; see for instance Agresti (2002, p. 442). Therefore, in 
the following, we consider the full sampling model (1). Eventually, neither g{n/s\vi) 
nor A;(noo|?T'A, 7) will play a role in the default approch, but g{n^\r]) will do in the in- 
trinsic approach. We shall return to this issue later on in the paper, when considering 
the specification of an intrinsic prior, and in the Discussion. 
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As in previous Bayesian analyses of this problem, we assign a Dirichlet prior on 
the matrix of cell-probabilities tt. We do this for convenience, and to ease comparisons 
with existing results. In this way, we shall obtain a default BF for testing the equality 
of two correlated proportions. Later on, however, we shall significantly depart from 
previous works on this topic, because the prior under the alternative hypothesis will 
be suitably adjusted to the testing problem under consideration. Next lemma shows 
that the resulting prior on {r), 9, 7) is a product of independent beta distributions, 
shedding light on the implications of choosing a Dirichlet prior for tt. 

Lemma 2.1 Let tt ~ Dir{a), where a = (000,001,010,011) is a two-by-two matrix of 
(strictly) positive values. Then: i) rj Beta{aoi+aio, ooo+on); ii) 6 ~ Beta{aoi, oio); 
Hi) 7 ~ Beta{aoQ, on); iv) r], 9 and 7 are mutually independent. 

Proof. Substitute ttoo = 7(1 — 77), ttoi = 9rj and ttio = (1 — ^)r7 in the Dirichlet density 
of (ttoo, TToi, TTio), and multiply by Jacobian r){l — rj) to get the density of (77, ^,7). □ 
Recall that the BF of H versus Hq, or more briefly in favor of H, having ob- 
served n, is given by BFh,Ho{'it) — mn^in) ' "where mnin) and mHo{n) are the marginal 
probabilites of n under H and Hq, respectively. More specifically, we have mnin) — 
JJJ f{n\r],e,-f)pH{r],9,-f)dr]ded-f and mHo{n) = JJ f{n\r],9 = 1/2, -f)pHo{ri, 7) d^dj, 
where pniv^ ^, 7) is given by Lemma 2.1 and PHoiv^ l) has yet to be specified. If we 
derive it from Ph{Vj ^1 1) i) the marginal Ph{Vj l) or ii) the conditional Ph{Vj1\(^ — 
1/2) we obtain the same result: a product of two independent beta densities. Using 
this induced prior on Hq, the following lemma produces a default BF against the 
equality of two correlated proportions. 
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Lemma 2.2 If n Dir{a), then a default BF in favor of H is given by 

2"^S(aoi +noi,aio + nio) 

where B{a,b) — /^^^""^(l — t^'^dt, a > 0, b > 0, is the Beta special function. 

Proof. Since the prior density of {ri, 7) under Hq corresponds to Ph{Vj = 1/2), the 
Savage density ratio apphes; see e.g. O'Hagan & Forster (2004, Sec. 7.16). This gives 
BFH,Ho{n) = , where Pi/(6'|n) is the posterior density of 9. Now, from ii) of 

Lemma 2.1, the prior of 9 is Beta(aoi, oio), whence ph{9 — 1/2) — ^^^^^ ^^^^ Q-^aoi+aio. 
on the other hand, using (1) and the independence of 77, 6* and 7, the posterior density 
of 9 is easily seen to be Beta(aoi + noi, aio + nio). The result follows. □ 
Lemma 2.2 is also derived in Kateri et al. (2001), by direct computation of mij(n) 
and ruHoin), and in Irony et al. (2000), using the conditional model /i(noi|nA, 9) in (1). 



3 Intrinsic approach 
3.1 General 

The BF is notoriously sensitive to prior specifications. This suggests that the choice 
of prior distributions should be especially thoughtful. In particular, we would like to 
consider one specific aspect of prior specification which is often overlooked. When 
testing Hq versus H, the Hq hypothesis must have some reasonable grounds, otherwise 
we would not have considered it in the first place. Assuming that Hq is nested in H, 
this suggests that the prior under H should be centered around the region in which 
Hq obtains. Otherwise, when data are just in reasonable accord with Hq, the prior 
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under H is simply "wasting away" prior probability mass in regions that are too 
unlikely to be supported, and Hq would be unduly favored. This aspect is strictly 
related to the Jeffreys-Lindley's paradox; see Robert (2001, p. 234). Notice that 
centering the prior under H around values consistent with Hq does not lend support 
to Hq, but rather to H: a point which is often overlooked in Bayesian applications. 
The previous considerations are lucidly spelled out in Casella & Moreno (2007); see 
also Casella & Moreno (2006). We now briefly review their approach, showing some 
basic facts about intrinsic priors and the associated BFs. 

Let y be the actual data, and denote with x a corresponding set of "imaginary" 
data. We shall assume, for simplicity of exposition, that the data are discrete, as in the 
problem that motivates our paper. Let the sampling distribution of y be fniyl^H), 
under H, and fHoiul'^Ho), under Hq. Let the priors under both models be given, 
and denote them with ph{^h) and Pho{^Ho), respectively. We shall not insist on the 
nature of these priors, although they will typically be standard "estimation-based" 
priors. In this paper, we shall restrict our attention to proper priors; see Moreno 
et al. (1998) for an intrinsic limiting procedure dealing with improper priors. Finally, 
let niHix) = / fH{x\\H)PH{>^H)d\H and mHo{x) = / fHo{x\>^Ho)PHo{^Ho)d^Ho be the 
marginal probabilities of the imaginary data x under H and Hq, respectively. 

Definition 3.1 The intrinsic prior for \h conditionally on Hq is given by 

p'h{Xh\Hq) = ph{Xh)Ex, 

where BFh,Ho {x) is the BF of H versus Hq having observed the imaginary data x, 
and the expectation is taken with respect to the sampling distribution fnixlXn)- 
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mHo[x) 
rriHix) 



= Ph{Xh)Ex„ [BFh,Ho{x)-'], 



Proposition 3.1 If pn o-nd pn^ are proper, the intrinsic prior p^jj is also proper. 



Proof. Interchanging the sum over x and the integral with respect to Xr, we find 



PH{XH\Ho)dXH = Yl I fxMXH)PH{.XH)dXH 



1. 



□ 



Proposition 3.2 The intrinsic prior is equivariant to reparametrization: if Xh^i^h, 
then Pff"^" {iPh\Ho) — P^h"^ {Xh{'>Ph)\Hq) x \ J\h{''Ph)\i where J\u{tpH) is the Jacobian. 

Proof. Expressing the samphng distribution of y in terms of Xh, we find 



mHo[x) 
rriHix) 



and the thesis follows. 



□ 



The two propositions above show that Definition 3.1 is well-posed. Note, however, 
that the intrinsic prior depends on the sampling distribution of x, and in particular 
on its sample size (prior sample size). As for the interpretation of Definition 3.1, the 
following proposition is enlightening. 



Proposition 3.3 The intrinsic prior is a mixture of imaginary posteriors: 



Ph{Xh\Ho) = y^^pH{XH\x)mHo{x) 



(3) 



Proof. By direct computation, we find 



Ph{Xh\Hq) ^Ph{Xh)Exh 



mHo[x) 
mnix) 



^PH{XH)fH{x\XH) 



and (3) follows. 



□ 



9 



Proposition 3.3 shows that the intrinsic prior will be more dominated by posteriors 
corresponding to imaginary realizations that are more supported under Hq, and this 
explains why it will be more concentrated around values consistent with Hq than 
the original prior Ph{Xh)- The extent of this concentration will depend both on 
the structure of Ph{Xh) and on prior sample size. Notice that Ph{Xh\Ho) is an 
example of expected posterior prior (with base model Hq) as defined by Perez & 
Berger (2002). As a consequence of Proposition 3.3, intrinsic prior BFs will also be 
mixtures of conditional BFs, over all possible configurations of the imaginary data x, 
with weights given by the marginal distribution of x under Hq. 

Definition 3.2 For data y, the intrinsic prior BF in favor of H is given by 

where m^ly) — J /ij(y|Ai^)p^(AH|i7o) is the intrinsic marginal probability ofy. 
Proposition 3.4 For data y, the intrinsic prior BF in favor of H is given by 

BFIj^hM = ^BFH,HMx)'n^H^{x), 

X 

where BFhhMx) = S fH{y\XH)vH{XH\x)dXH ^ m«(^_ 

Proof. Substituting (3) in (4), then interchanging the sum and the integral, we obtain 

BPff^Hoiv) = / fH{yM^PHiXH\x)mHo{x)dXH 

sr^ , .J fH{y\XH)pH{XH\x)dXH 

and the thesis follows. □ 
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3.2 Conditionally-Intrinsic Prior and Test 

The default BF of Lemma 2.2 for testing the equahty of two correlated proportions 
was obtained in the full sampling model (1), although it can also be derived using 
the conditional model h{noi\nA,0). The following argument provides a Bayesian 
justification for basing inference solely on the conditional model. 

Since t] and 7 are nuisance parameters in the testing problem under consideration, 
a natural suggestion is first to integrate them out, thus obtaining an "integrated 
model" which only depends on the parameter of interest 9, then to proceed with 
the testing problem. Consider first the alternative hypothesis H. Using (1) and the 
independence of 77, 9 and 7, we immediately obtain that the integrated model under H 
is given by 

f^{n\9) = mH{nA)mHinoo\nA)h{noi\nA, 9), (5) 

where m^lnA) = / g{nA\r))PH{v)dr) and mH(noo|nA) = / k{noo\nA,j)PH{j)dj. Con- 
sider next the null hypothesis Hq, whose sampling model is obtained from (1) by 
setting 9 — 1/2. To integrate out r} and 7 we need a prior Pi/o(ry,7). Since they 
are nuisance parameters, one possible argument is that this prior should be the 
same as that under H; see e.g. O'Hagan & Forster (2004, Sects. 11.29-11.33) in a 
different context. In other words PHoiVjl) should be obtained from Ph^'Hi^^iI) by 
marginalization. Alternatively, one could think of obtaining PHoiVjl) by condition- 
ing, i.e. PHoi'H-il) — Ph{Vt1\^ — 1/2)- In either case we obtain the same result, 
because of the independence of 77, 9 and 7, so that the integrated model under Hq is 

/^oW mH{nA)mH{noo\nA)h{noi\9 = 1/2, ua) , 
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where mnini^ and mH{nQo\n/^ are the same as in (5). Prom the integrated models 
fffinlO) and (n) one can now compute the BF, by taking the ratio ^ fn^'^f)PH{^)'^^ _ 

JHq v^) 

We obtain fe^(^)||j\^[^'^'^g=]^/2j'^^ ) factors mH{n/\) and mij(noo|?^A) clearly cancel out. 
This is the default BF based on the conditional sampling model /i(noi|nA, 0)- Hence, 
provided we are willing to take the same marginal prior for (77,7) under H and Hq, 
we can forget about g{n^\r]) and A;(noo|?T'A, 7) in (1)- 

In the following of this subsection, we apply the intrinsic prior methodology based 
on the conditional sampling model h{nQi\n/\,6): the resulting prior will be called 
"Conditionally- Intrinsic" (CI). Both its density and the corresponding BF will be 
labeled with a superscript "CI" ; moreover, for the sake of clarity, other intermediate 
quantities will have the superscript "Co" to remind the reader that they all refer to 
the above Conditional model. The general notation of Subsection 3.1 will be reserved 
for the next subsection, wherein the full sampling model (1) will be employed. Note 
that, within the CI- procedure, the imaginary data x are represented by (xoi,xa), 
where = Xqi + Xio, or equivalently by (a;oi,a;io). 

We start by defining the Cl-prior, which is given by 



wnere nfjjjj^yx) - h{xoi\xr:.,e=i/2) ■ 



Proposition 3.5 The Cl-prior for 6 conditionally on Hq is given by 

Ph\0\Hq)= r^^J fij Beta{9\aoi + Xoi,aA- aoi + XA- xoi), 

where Beta{9\A, B) is the Beta density with parameters A and B, evaluated at 6. 
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Conditional-Intrinsic Priors 




Figure 1: Conditional-intrinsic prior densities, for ten different prior sample sizes, 
starting from a uniform distribution on the cell-probabilities. 

Proof. The general structure of the intrinsic prior is given in (3). It is immediate 
to check that p'h'{0\x) is actually 'Qeta{9\aoi -l-xoi, oa — clqi +x/\, —xqi). On the other 
hand, it holds that m%{x) = /i(xoi|xa, 9^1/2)= (^^^J {\y^ . □ 

Proposition 3.5 shows that the Cl-prior for ^ is a finite mixture of Beta distribu- 
tions with + 1 components, where xa is the prior swing count. Figure 1 plots the 
Cl-prior for 6, starting from a uniform distribution on tt, for ten different prior sample 
sizes (values of xa)- We include the zero sample size, which gives back the uniform 
default prior for 6. Notice how the Cl-prior gets more and more peaked around the 
null value ^ = 1/2, as prior sample size grows. 
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We now turn to the derivation of the Cl-prior BF, namely 

where m%l{n) = h{noi\n/s,,9 = 1/2) and m%\n) = J h{nm\n/x,,9)p%^ {9\Ho)d9. 
Proposition 3.6 The Cl-prior BF in favor of H is given by 

^01=0 / \ / 

where BF^° in\x) — 2"^ '^^""^ +xoi+noi -api +x a -xqi +nA -noi ) 

Proof. Prom Proposition 3.4 we immediately derive that the Cl-prior BF is a mixture 
of conditional BFs. The weights of the mixture are exactly those of Proposition 3.5. 
On the other hand BFgX{n\x) = ^ ''tlnlXifi%^'' ^ ^i^h ph{9\x) a Beta(^; aoi + 
3^01) — + xa — Xqi) distribution. Standard calculations lead to the result. □ 



3.3 Intrinsic Prior and Test 

In this subsection we work in the full sampling model (1), involving all three param- 
eters r], 9 and 7, and derive the corresponding intrinsic prior. This is defined as 

Ph{v,^,i\Ho) = pH{v,On)Er,,e,^[BFH,Ho{x)~^] , 

according to Definition 3.1, where BFh,Ho{x) is given by (2) with x in place of n. Since 
77, 9 and 7 are independent, and BFh^h^^x) does not depend on xqq, it turns out that 
p^(77,e,7|//o) = PH{l)p'HiV:0\Ho), where p'h{V:0\Ho) = pH{V:0)E„,e[BFH,Ho{x)-']. 
In other words, under the I- prior 7 is still independent of [r], 9), and its marginal is 
unchanged. As a consequence, parameter 7 and data Xqq play no role and can be 
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disregarded. In the following we shall focus our attention on parameters rj and 9, 
and imaginary data x will be represented by the triple (xqi, Xiq, x+^), or equivalently 
by {xoi,xa,x++), where a;++ is the prior sample size. 

Proposition 3.7 The intrinsic prior for {r),9) conditionally on Hq is given by 
PH{r],9\Ho) = X] X] Beta{r]\aA + XA,a++-aA + x++-XA) 

a;oi=0 a;A=a;oi 

X Beta{9\aoi + xoi,aA — aoi + xa — xoi) 

^ / xa\ /I V'^ / x++\ B{aA + XA,a++ - Qa + x++ - xa) 
\X01J\2J \xa J B{aA,a++-aA) 

Proof. Prom (3) we know that Pnir}, 9\Ho) — ^^Ph{V: 9\x)mHo{x). Recalling (1) and 

the distributional results of Lemma 2.1, the expression for pH{r],9\x) is easily seen to 

be Ph{Vj 9\x) = Beta{ri\aA + xa, a++ — oa + 3:++ — xa) x Beta{9\aoi +xoi, a a — 0,01 + 

^^A — 3^01) SO that, conditionally on x, rj and 9 are independent. On the other hand, 

the weight mHo{x) is given by 

rriHoix) = j g{xA\r])h{xoi\xA,9 = l/2)pHoiri)dv 

^ {lo){lJ\f [''^^)v''H^-Vr^^-''''Beta{v\aA,a++-aA)dv 

_ /xa\ / 1 V'^ / x++\ B{aA + Xa, a++ - Qa + x++ - Xa) 
\X01J\2J \xa J B{aA,a++-aA) 

and the thesis follows. □ 

Notice that the prior pHoiv) is set equal to PHiv), the marginal prior for rj 
under H, in agreement with the choice that was made in Subsection 3.2. In this way 
we shall be able to make more sensible comparisons with the Cl-procedure. 

Figure 2 presents the default distribution of [r], 9) corresponding to the uniform 
distribution of tt, together with two I-priors having different prior sample sizes, namely 
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= 10 and x++ = 50, to show the increasing concentration of the joint distribution 
around the 9 = 1/2 hne. An interesting feature is represented by the "pear" -shaped 
contour hues depicted in the bottom-right paneh they are all symmetric around the 
line 6 = 1/2, and they pile up around it as the prior sample size increases, but the 
piling is not uniform with respect to r}. More precisely, the range of credible values 
for 9 (effective support) becomes smaller as the value of r] increases. This shows a 
dependence structure of the two parameters under the I-prior, in contrast with the 
independence between 77 and 9 under the default prior. Figure 3 makes this point 
explicit by plotting the contour lines of the joint distribution of (77, 9) under the I- 
prior, based on a;++ = 20 prior observations, and those (represented by ellipsoids) of 
a prior having the same marginal distributions, but embodying independence of the 
two parameters. 

Recall that in the Cl-approach parameter 77 was integrated out at the very begin- 
ning, using the same prior under H and Hq, since 77 is a nuisance parameter. One 
could argue that the intrinsic approach should implicitly recognize that is a nui- 
sance parameter. In particular, centering the marginal prior for 77 around Hq makes 
no sense, so that the marginal prior for 77 should remain unaffected by the intrinsic 
procedure. This conjecture turns out to be true, as the following proposition shows. 

Proposition 3.8 The marginal distribution ofrj under the intrinsic prior is the same 
as the original marginal prior, i.e. p^(77|i?o) —Ph{v)- 

Proof. Prom the expression of p^(77|i7o) in Proposition 3.7 we derive 

X++ X + +—X01 

PnivlHo) = XI XI Beta{rj\aA + XA,a++- aA + x++- xa) 

a;oi=0 i;io=0 
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Intrinsic Prior Density Surface 



Intrinsic Prior Density Surface 




Prior sample size equal to Prior sample size equal to 10 

Intrinsic Prior Density Surface Intrinsic Prior Contour Lines 




0.0 0.2 0.4 0.6 0.8 1.0 

e 

Prior sample size equal to 50 



Figure 2: Default prior density (top-left), two diflFerent intrinsic prior densities (top- 
right and bottom- left), and their contour lines (bottom-right). 
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Intrinsic Prior Contour Lines 




Figure 3: Contour lines for the intrinsic prior density with = 20, compared to 
those of the bivariate density having the same marginals with rj and 9 independent. 

B{aA + a;oi + xio, a++ - ua + x++ - xqi - xio) 
B{aA: «++ - «a) 

X BetalrjlaA, a++ — oa) 

□ 

On the other hand, the Cl-prior for 9 cannot be recovered (exactly) within the 
I-procedure, as shown in Figure 4 for a selection of prior sample sizes, best 
approximating the Cl-prior based on xa — 20. The marginal distribution of pa- 
rameter 9 under the I-procedure is different from the corresponding prior under the 
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Marginal Intrinsic Priors 




Figure 4: Marginal intrinsic prior densities for 6 best approximating the Cl-prior 
density based on = 20 (dashed hne). 

Cl-procedure. The reason hes in the different structure of the imaginary data x: 
under the Cl-procedure we have x = (xqijXa), while under the I-procedure we have 
^ — {xqi-iXa-iX++)- This gives rise to different weights: compare mnix) in the proof 
of Proposition 3.7 to m^{x) in the proof of Proposition 3.5. 

Let us now consider the expression for the I-prior BF. From Proposition 3.4 we 
immediately derive that this is a mixture of conditional BFs, the weights of the 
mixture being the same as in the proof of Proposition 3.7. Hence, BF^jj^{n) — 
'^^BFh^Ho{'>^\^)^Ho{^)j where we need to compute 

. I X I Uj{'iA\ilV>{'iin\nA-0)pii{il-^U-)dnde 

Bi'H,Ho[nx) = . {b) 

niHoin) 
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The denominator of (6) is the same as niHoix) with x replaced by n. On the other 
hand, the numerator of (6) is equal to 

^n++\ i?(aA + + ^A, a++ - oa + - xa + n++ - n^) 



n/\_ J B{aA + a++ — oa + x++ — xa) 

nA\ B{aoi + xoi + noi, oa - ooi + xa- xqi + tia - rioi) 



^noij B{aoi + xoi,aA - aoi + xa - xoi) 

so that we obtain 

B(aoi + xoi + noi, oa - Qoi + xa- xoi + riA - n-oi) 



BFH,Ho{n\x)mHo{x) = 2"^ 



X 



X 



i?(aoi + Xqi, aA - ooi + a^A - Xoi) 
B{aA + xa + nA, a++ — ca + — xa + n++ — ua) 
B{aA + ^A, a++ - Oa + n++ - wa) 



2^^ V^oi a^io - xa) ) ' 
One can recognize that the first factor above is exactly BF^\J(n\x\ as given in 
Proposition 3.6. Therefore, multiplying and dividing by 2^++, we get 



BFji^Hoi^) (yxoiXw{x++ - xoi - xio)) 



X 2^++ 



^a;oia;io(x++ - xqi - xio)J 2^^+^++ 
B{aA + xa + nA, a++ — ca + x++ — xa + n++ — ua) 



B{aA + Ua, a++ - oa + n++ - tia) 



Curiously, the second line in the above expression can be further recognized as a 
particular BF of the hypothesis rj ^ versus the alternative f] = \, relative to 
the marginal binomial sampling model g, having observed the imaginary data x, and 
assuming a Beta(aA + ?T'A, o++ — oa + ?T'++ — ?T'a) "prior" distribution on 77. By writing 
this Bayes factor as BF^i (a;|n), and suitably re-writing the factor ^x^+x++ , we 
get to the following proposition. 



20 



Proposition 3.9 The I-prior BF in favor of H is given by 



X++ W 1 \ f 1 \ / 1 
xoi xiQ X++ - xa) v4 



2^01 /l \ ^10 /l \ X++-XA 



XQl X\o 

X BF^^^.. {x\n) X BF^^^M^l 



where BF^"rr (nix) = '""^ ^"".f "^T a™<^ ^F^Ii i( 



_x\n} 

2''l~2 



<2x++ -B(aA +a:A+nA)a++— "A+^++~^A+n++— wa) 
B(aA +nA — OA +n++ — wa) 



Hence, BFjj j^^ (n) is a mixture of products of two BFs: the CI- prior BF derived 
in the previous section, and the marginal BF of 77 7^ | versus f] — \ using imaginary 
data and an actual posterior distribution on rj. 



4 Examples 

We illustrate our methodology using three examples. They were chosen by Irony 
et al. (2000) in order to reflect agreement or disagreement on testing Hq versus H 
between a frequentist approach (p-value) and a Bayesian approach (default BF of 
Lemma 2.2 derived from a uniform prior on the vector of cell-probabilities tt). The 
analysis carried out in Irony et al. (2000) also involves estimating 9, as well as other 
parameters, but we shall not report on it. Conventionally, Irony et al. (2000) reject 
the null when the p-value is below 5%, in the frequentist mode, and when the default 
BF against the null is above 1, in the Bayesian mode. 

We shall provide an alternative Bayesian analysis based on our intrinsic prior 
methodology, also starting from a uniform prior on the cell probabilities. For ease 
of communication, we transform the BF to the probability scale of Hq, assuming 
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prior odds equal to one, i.e. we consider Pr{HQ\n) — i^^p^ ^ and similarly for 
Fr^\Ho\n) and Fr\Ho\n). 

For each example, rather than selecting a single intrinsic prior, we shall assess the 
sensitivity of our results to prior sample size. In the spirit of Fan & Berger (2000), 
this will be achieved through a plot of the posterior probability of Hq as a function 
of the ratio between prior and actual sample size, which we label q. Specifically, 
q — for the I-case, and q — x^/n^ for the Cl-case, < g < 1. 

We now present and discuss the three examples. Although the data are fictitious, 
for the sake of exposition they are presented as a survey of individuals expressing 
support ("Yes", "No") for the President, before and after a Presidential address. 

Example 4.1 The data for this case represent a random sample of 100 interviewed 
individuals, and are reported in Table 4..!. The p-value is 25% and Pr(ifol^T') = 0.64, 
so that neither approach would reject Hq. 

Example 4.2 In this example a random sample of 100 interviewed individuals gave 
rise to Table The p-value is A%, so that Hq would be rejected, while Y'r{HQ\n) — 
0.29, leading to the same conclusion. 

Example 4.3 In this last example the sample size is only 14 and the outcome of 
the survey is given by Table 4-3. The p-value is 7%, suggesting that Hq should not 
be rejected; on the other hand Pr{HQ\n) — 0.22, pointing to the opposite conclusion. 
Thus, for this example, there seems to be a clear conflict between the conclusions 
under the two approaches. 
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Support for the President 
After 

Before No Yes Total 
No 20 17 37 
Yes 10 53 63 
Total 30 70 100 

Table 1: Data for Example 1. 

Support for the President 
After 

Before No Yes Total 
No 20 21 41 
Yes 9 50 59 
Total 29 71 100 

Table 2: Data for Example 2. 

Support for the President 
After 

Before No Yes Total 
No 1 7 8 
Yes 1 5 6 
Total 2 12 14 

Table 3: Data for Example 3. 
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We now proceed with the intrinsic analysis of the three examples, having in mind 
that sensitivity to prior specifications is a major concern when evaluating BFs. Ac- 
cordingly, our results are visually summarized in Figure 4. For each example we 
present two panels: on the left-hand-side we plot the Cl-prior density for selected 
values of the fraction of prior to actual sample size q, namely q = 0, 0.25, 0.50, 0.75, 1. 
Clearly, q — corresponds to the starting default prior for 9, that is the uniform dis- 
tribution on the interval (0, 1). We also plot in this panel the normalized likelihood 
function. In the right panel we present the plots of Pr^\Ho\n) and Pr\Ho\n), as a 
function of q. Notice that for all three datasets the two plots are remarkably similar. 

Consider first Example 1. The left panel reveals a mild confiict between the 
likelihood and the null hypothesis, as well as the Cl-prior which by construction is 
centered on Hq. The right panel shows that Fr'" \Ho\n) and Fr\Ho\n) are decreasing 
for < g < 1, spanning a range from 0.64 (default value) to 0.47. This can be ex- 
plained as follows: the data do not contradict explicitly Hq and thus there is scope for 
diminishing the probability of Hq by peaking the prior under H around Hq. Although 
the curves cross the 50% threshold, this only occurs if q exceeds 70%: a value that 
appears to overemphasize the role of the prior. In conclusion, the rejection of Hq on 
conventional grounds is not totally robust, but is broadly justified. 

Consider now Example 2. The left panel reveals an appreciable conflict between 
the likelihood on the one hand, and the null and the Cl-prior on the other. This 
aspect translates into a posterior probability of Hq approximately ranging from 0.24 
to 0.29 (default value). Notice that in this example the behavior of Fr'"\HQ\n) and 
Fr^{HQ\n) is not monotone as a function of q. To understand why this occurs, recall 
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Cl-Priors and Normalized Likelihood for Example 1 



Posterior Probability Plot for Example 1 



likelihood 
q = 
q = 0.25 
q = 0.5 
q-0.75 
q = 1 
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Cl-Priors and Normalized Likelihood for Example 2 



Posterior Probability Plot for Example 2 
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Cl-Priors and Normalized Likelihood for Example 3 



Posterior Probability Plot for Example 3 
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Figure 5: Intrinsic prior and posterior probability of Hq. sensitivity to prior sample 
size for each of the three examples {q is the fraction of prior to actual sample size). 
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that the data are in disagreement with the null. As a consequence, centering the prior 
under H around the null can only benefit the posterior probability of Hq provided the 
prior sample size is not too large; otherwise the prior under H will be too similar to 
that under Hq. In conclusion, the conventional rejection of Hq appears fully robust; 
yet the default value of 'Px{HQ\n) is possibly too high, since a moderate prior sample 
size determines a relatively sharp decrease in {HQ\n) and Y*T\HQ\n). A value 
close to the 0.25 threshold seems therefore more reasonable. 

Finally, turn to Example 3. Here the likelihood reveals a marked disagreement 
with the null, although it has a rather heavy left tail because of the limited number 
of observations. The family of Cl-priors is accordingly more dispersed around the 
value 6 — 1/2, because they are based on prior sample sizes which are a fraction 
of = 8. Notice that Pr^'^(i7o|w) and Pr^(i7o|?^) are now monotone increasing 
from 0.22 (default value) to 0.26. This is a clear example where the intrinsic prior 
methodology contributes to water down the evidence against the null, thus defying 
its original purpose. This happens because the data are already in good agreement 
with the alternative, and thus peaking the prior under H around Hq only accumulates 
probability mass in areas not supported by the likelihood, thus increasing the posterior 
probability of Hq. Hence, in this case too, the conventional rejection of Hq is robust. 

5 Discussion 

In this paper we considered Bayesian hypothesis testing for the comparison of two 
nested hypotheses, using the BF as a measure of evidence. Alternative approaches 
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may be useful, e.g. those based on measures of divergence such as the symmetrized 
KuUback-Leibler; see Bernardo (2005). However, when converted to the probabihty 
scale of the null, assuming prior odds equal to one, the BF is a communication 
device of unsurpassed clarity. However, since the BF is strongly influenced by prior 
specification, whose effect does not decrease as the sample size increases, sensitivity 
analysis is needed to evalutate robustness of the conclusions. 

An objective framework for sensitivity analysis is offered by the intrinsic prior 
methodology: starting from default priors both under the null and the alternative 
(encompassing) hypothesis, a family of prior distributions under the alternative is 
constructed such that its elements are increasingly concentrated around the null, as 
prior sample size grows. In this way, we avoid to unduly favor the null, especially 
when the data do not contradict it explicitely 

For the problem of testing the equality of two correlated proportions, there is 
a choice between using the full sampling model (1) and the conditional sampling 
model given the overall number of swing. We considered both approaches, obtaining 
I-priors and Cl-priors, respectively. When applied to the examples of Section 4, the 
two approaches proved to be essentially equivalent, altough in general they are not. 
In particular, as shown in Figure 6, the posterior probability of the null depends on 
the total sample size n++ in the I-approach only; see Ghosh et al. (2000) for similar 
results obtained using a Rash-type Bayesian hierarchical model. 

A point that was not addressed in this paper concerns consistency of the Cl-prior 
and I-prior Bayes factors. Casella & Moreno (2006), discussing tests of independence 
in two-way contingency tables, show that the intrinsic BF is consistent provided the 
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Posterior Probability Plot 




Figure 6: Posterior probability of the null, as a function of n++, for a table n such 
that noi — riiQ — 5, with x++ = 10 and xa — ^ for I-priors and Cl-priors, respectively. 

ratio of prior to (actual) sample size goes to zero as the sample size goes to infinity. 
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Finally, the problem of comparing two correlated proportions is a special case of 
the more general problem of testing the hypothesis of marginal homogeneity in square 
contingency tables; see Bergsma (1997, Ch. 3) for an extensive review. 
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